Non-filter waveform generation from cepstrum using spectral phase reconstruction

نویسندگان

  • Yasuhiro Hamada
  • Nobutaka Ono
  • Shigeki Sagayama
چکیده

This paper discusses non-filter waveform generation from cepstral features using spectral phase reconstruction as an alternative method to replace the conventional source-filter model in text-to-speech (TTS) systems. As the primary purpose of the use of filters is considered as producing a waveform from the desired spectrum shape, one possible alternative of the sourcefilter framework is to directly convert the designed spectrum into a waveform by utilizing a recently developed“ phase reconstruction”from the power spectrogram. Given cepstral features and fundamental frequency (F0) as desired spectrum from a TTS system, the spectrum to be heard by the listener is calculated by converting the cepstral features into a linear-scale power spectrum and multiplying with the pitch structure of F0. The signal waveform is generated from the power spectrogram by spectral phase reconstruction. An advantageous property of the proposed method is that it is free from undesired amplitude and long time decay often caused by sharp resonances in recursive filters. In preliminary experiments, we compared temporal and gain characteristics of the synthesized speech using the proposed method and mel-log spectrum approximation (MLSA) filter. Results show the proposed method performed better than the MLSA filter in the both characteristics of the synthesized speech, and imply a desirable properties of the proposed method for speech synthesis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech synthesis using a maximally decimated pseudo QMF bank for embedded devices

A fast speech waveform generation method using a maximally decimated pseudo quadrature mirror filter (QMF) bank is proposed. The method is based on subband coding with pseudo QMF banks, which is also used in MPEG Audio. In the method, subband code vectors for speech sounds are synthesized from magnitudes of spectral envelope and fundamental frequencies for periodic frames, and then waveforms ar...

متن کامل

Advances in Spectral Parameterization for Statistical (HMM-Based) TTS

HMM-based parametric speech synthesis has recently become an alternative to the concatenative TTS approach, especially when low footprint and general speech domain are required. A majority of speech parameterization models used in state-ofthe art HMM TTS systems employ source-filter waveform synthesis schemes. Sinusoidal representation and waveform generation of speech is an alternative to the ...

متن کامل

Speech waveform synthesis from MFCC sequences with generative adversarial networks

This paper proposes a method for generating speech from filterbank mel frequency cepstral coefficients (MFCC), which are widely used in speech applications, such as ASR, but are generally considered unusable for speech synthesis. First, we predict fundamental frequency and voicing information from MFCCs with an autoregressive recurrent neural net. Second, the spectral envelope information conta...

متن کامل

Instantaneous frequency estimation and order tracking based on Kalman filters

Order tracking (OT) is a signal processing technique that allows capturing the dynamics of the vibration signals measured from rotating machines. In this paper, its propose an OT scheme based on extended Kalman filter (EKF), which comprises the advantages of existing OT techniques. Proposed scheme allows estimation of different spectral/order components of the signal and, in addition, estimates...

متن کامل

Design and Applications of In-Cavity Pulse Shaping by Spectral Sculpturing in Mode-Locked Fibre Lasers

We review our recent progress on the realisation of pulse shaping in passively-mode-locked fibre lasers by inclusion of an amplitude and/or phase spectral filter into the laser cavity. We numerically show that depending on the amplitude transfer function of the in-cavity filter, various regimes of advanced waveform generation can be achieved, including ones featuring parabolic-, flat-topand tri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016